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ATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCA 

TACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGT 

MDKTHTC PPC PAPELLGGPS 

GTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTC 

61 + + + - - + + + 120 

C AGAAGGAGAAGGGGGGTTTTGGGTTC C TGTGGGAGT ACTAGAGGGC CTGGGGAC TCC AG 

VFLFPPKPKDTLMISRTPEV 

AC ATGCGTGGTGGTGGAC GTGAGCC AC GAAGACCC TGAGGTCAAGTTCAACTGGTAC GTG 

121 + + + + + + 180 

TGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCAC 

TCVVVDVSHEDPEVKFNWYV 

GACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACG 

181 + + + + + + 240 

CTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGC 

DGVEVHNAKTKPREEQYNST 

TACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTAC 

241 + + + + + + 300 

ATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATG 

Y RVVSVLTVLHQDWLNGKEY 

AAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCC 

301 + + + + + + 360 

TTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGG 

KCKVSNKALPAPIEKTISKA 

AAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACC 

361 + + + + + + 420 

TTTCCCGTC GGGGCTC TTGGTGTCC AC ATGTGGGAC GGGGGTAGGGCCC TAC TC GAC TGG 

KGQPREPQVYTL PPSRDELT 

AAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTG 

421 + + + + + + 480 

TTC TTGGTCC AGTCGGAC TGGAC GGACC AGTTTCCGAAGATAGGGTC GCTGTAGC GGC AC 

KNQVSLTC LVKGFYPSDIAV 

GAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGAC 

481 + + + + + + 540 

CTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTG 

EWESNGQPENNYKTTPPVLD 

TCCGACGGCTCC TTCTTCC TCTAC AGC AAGC TC ACCGTGGAC AAGAGC AGGTGGC AGC AG 

541 + + + + - - - + + 600 

AGGC TGCC GAGGAAGAAGGAG ATGTC GTTC GAGTGGC ACCTGTTCTCGTCC ACC GTC GTC 

SDGS FFLY S K'LTVDKSRWQQ 

GGGAAC GTCTTCTC ATGCTCC GTGATGG ATGAGGCTC TGC AC AACC AC TAC AC GC AGAAG 

601 + + + + + 660 

CCC TTGC AGAAGAGTAC GAGGC AC TAC GT ACTC CGAGAC GTGTTGGTGATGTGCGTCTTC 

GNVFSCSVMHEALHNHYTQK 

AGCCTCTCCCTGTCTCCGGGTAAA 

661 + + 684 

TC GGAGAGGGAC AGAGGCCC ATTT 
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tBu Pbf Boc 
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Wang resin 



H-IEGPTLRQWLAARA-GGG-HN CO-GGGG-IEGPTLRQWLAARA-QH 

peptide 18 

O 

MeO-i PEG 5CMMD 



pH8 



MeQ- | PEG SOdMD j -N 

° i 

H-IEGPTLRQWLAARA-GGG-HN CO-GGGG-IEGPTLRQWLAARA-OH 

peptide 20 



draft: fi \- 



r lb. 



SUBCLASS 



o 



o 



Xbal 




I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

♦ + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 

MDKTHTC P - 



CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 

61 + + + + + + 120 

GTGG AAC AGGTCG AGGCCTTGAGGAC C CC CCTGGC AGTC AGAAGGAGAAGGGGGGTTTTG 
C PCPAPELLGGPSVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 
121 + + + + + + iso 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
c KDTLMI S RT PEVTCVVVDVS- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 

181 + + + + + + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
C HEDPEVKFNWYVDGVEVHNA- 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 
241 + + + + + + 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
c KTK PREEQYNSTYRVVSVLT- 

C C GTC C TGC AC C AGG AC TGGC T G AATGGC AAGG AGT AC AAG TGC AAGGTC TC C AAC AAAG 

301 + + + + + + 360 

GGC AGGAC GTGGTCCTGACCGACTTACCGTTC CTC ATGTTCACGTTCC AGAGGTTGTTTC 
c VLHQDWLN GKEYKCKVSNKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 + + + ♦ + + 420 

GGG AGGGTC GGGGGT AGC TC TTTTGGT AG AGGTTTC GGTTTC C C GTC GGGGC TC TT GGT G 
c LPAPIEKTI 3KAKGQPREPQ- 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 
421 ♦ + + + + + 480 

TCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
c VYTL PPSRDELTKNQV3LTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 
481 + + + + + + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
c LVKGFYPSDIAVEWESNGQP- 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 + + + + + + 600 

GC C TCTTGTTGATGTTC TGGTGC GGAGGGC AC GAC C TG AGGC TGC C G AGG AAG AAGG AG A 
C ENNYKTTPPVLDSDGS FFLY- 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 ♦ + + + + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
C SKLTVDKSRWQQGNVFSCSV- 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 + + + + + + 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
c MHEALHNHYTQKSLSLSPGK- 

AAGGTGG AGGTGGTGGT ATC GAAGGTCC G AC TC TGC GTC AGTGGC TGGC TGC TC GTGCTT 

721 + + + + .'.v. + + 78O 

TTCCACCTCCACCACCATAGCTTCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGAA 
c GGGGGI EG PTLRQWLAARA*- 

BamHI 
I 

AATCTCGAGG ATC C 
781 + 794 



TTAGAGCTCCTAGG 



r !u. 



r v . pUSCLASS 

DRAFT 2! iA : i 



Xbal 




I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

1 + + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 
C MDKTHTCP- 

CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 

61 + + + + + + 120 

GTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGCAGTCAGAAGGAGAAGGGGGGTTTTG 
c PCPAPELLGGPSVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 + + + + + + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
c KDTLMI SRTPEVTC VVVDVS- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 

181 + + + + + + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
C HEDPEVKFNWYVDGVEVHNA- 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 + + + + + + 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
C KTKPREEQYNSTYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 + + + + + + 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
c VLHQDWLNGKEYKCKV3NKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 + + + + + + 420 

GGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTT 
C LPAPIEKTISKAKGQPREPQ- 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 + + + + + + 480 

TCCACATGTGGGACGGOGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
c VYTLPPSRDELTKNQVSLTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 - + + + + + + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 
c LVKGFYPSDIAVEWESNGQP- 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 ♦ + + + + + 600 

GCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGA 
C ENNYKTT PPVLDSDGSFFLY- 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 + + + + + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
C SKLTVDKSRWQQGNVF3CSV- 

TGATGCATGAGGCTCTGCACAACGACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 + + + * ♦ ♦ 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
c MHEALHNHYTQKSLSLSPGK- 

AAGGTGGAGGTGGTGGTATCGAAGGTCCGACTCTGCGTCAGTGGCTGGCTGCTCGTGCTG 

721 + + + + + --- + 780 

TTCCACCTCCACCACCATAGCTTCCAGGCTGAGACGCAGTCACCGACCGACGAGCACGAC 
c GGGGGI EGPTLRQWLAARAG- 

GTGGTGGAGGTGGCGGCGGAGGTATTGAGGGCCCAACCCTTCGCCAATGGCTTGCAGCAC 

781 + + + + + + 840 

CACCACCTCCACCGCCGCCTCCATAACTCCCGGGTTGGGAAGCGGTTACCGAACGTCGTG 
C GGGGGGG I EG PTLRQWLAAR- 

BamHI 
I 

GCGCATAATCTCGAGGATCCG 

841 + + - 861 

CGCGTATTAGAGCTCCTAGGC 

c A * - 



DRAFT;? 
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5S 


SUBCLASS 



Xbal 
I 

TC TAGATTTGTTTT AAC T AATTAAAGGAGGAATAACAT ATGATCGAAGGTCCGACTCTGC 

1 ♦ + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACTAGCTTCCAGGCTGAGACG 

MIEGPTLR- 



GTCAGTGGCTGGCTGCTCGTGCTGGCGGTGGTGGCGGAGGGGGTGGCATTGAGGGCCCAA 

61 + + - - + + + + 120 

CAGTCACCGACCGACGAGCACGACCGCCACCACCGCCTCCCCCACCGTAACTCCCGGGTT 
c QWLAARAGGGGGGGGIEGPT- 

C C C TTC GCC AATGGCTTGC AGC ACGCGC AGGGGG AGGC GGTGGGG AC AAAAC TC AC AC AT 

121 + + + + + + 180 

GGGAAGCGGTTACCGAACGTCGTGCGCGTCCCCCTCCGCCACCCCTGTTTTGAGTGTGTA 
C LRQWLAARAGGGGGDKTHTC- 

GTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAA 

181 + + + ♦ + + 240 

CAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGCAGTCAAAAGGAGAAGGGGGGTT 
c PPCPAPEL LGGPSVFLFPPK- 

AACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACG 

241 + + + + + + 300 

TTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGC 
C PKDTLMISRTPEVTCVVVDV- 

TGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATA 

301 + + + + + + 360 

ACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTAT 
c SHEDPBVKFNWYVDGVEVHN- 

ATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCC 

361 + + + + + + 420 

TACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGG 
c AKTKPREEQYNSTYRVV SVL- 

TCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACA 

421 + + + + + + 480 

AGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGT 
C TVLHQDWLNGKEYKC KVSNK- 

AAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAAC 

481 + + + + + + 540 

TTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTTCGGTTTCCCG 
c ALPAPIEKTI SKAKGQ PRE P- 

CACAGGTGTACACCCTGCC<;CCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGA 

541 + + + + + + 600 

GTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACT 
C QVYTLPPSRDELTKNQVSLT- 

C C TGC CTGGTC AAAGGCTTC T ATC C C AGC G AC ATC GC C GTGGAGTGGG AG AGC AATGGGC 

601 + + + + + + 660 

GGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCG 
c C LVKGFY PSDIAVEWESNGQ- 

AGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCC 

661 + + + + + + 720 

TCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGG 
C PENNYKTT P PVLD SDGS FFL* 



TCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCT 

721 + + + + + + 780 

AGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGA 
c Y S KLTVDKS RWQQGNVFSC S- 

CCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGG 

781 + + + + + + 840 

GGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCC 
c VMHEALHNHYTQKSLSLSPG- 



BamHI 
I 

GTAAATAATGGATCC 

841 + 855 

CATTTATTACCTAGG 
C K * 



F ! n 



0RAF7: 



^1 jSUSCLASS 
I 



Xbal 

I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGATCGAAGGTCCGACTCTGC 

1 + + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACTAGCTTCCAGGCTGAGACG 

MIEGPTLR- 

GTCAGTGGCTGGCTGCTCGTGCTGGTGGAGGCGGTGGGGACAAAACTCACACATGTCCAC 

61 + + + + + + 120 

CAGTCACCGACCGACGAGCACGACCACCTCCGCCACCCCTGTTTTGAGTGTGTACAGGTG 
QWLAARAGGGGGDKTHTCPP- 

CTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCA 

121 + + + + + + 180 

GAAC GGGTCGTGGACTTGAGGACCCCC CTGGC AGTC AAAAGG AGAAGGGGGGTTTTGGGT 
C PAPELLGGPSVFLFPPKPK- 

AGGAC ACCCTC ATGATCTCC CGGACC C CTGAGGTC AC ATGCGTGGTGGTGGACGTGAGCC 

181 + + - + + + + 240 

TCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGG 
DTLMI SRTPEVTCVVVDVSH- 

ACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATGCCA 

241 + + + + + + 300 

TGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTACGGT 
ED PEVKFNWYVDGVEVHNAK- 

AGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCG 

301 + «■ + + + + 360 

TCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGC 
TKPREEQYNSTYRVVSVLTV- 

TCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCC 

361 - - + + + + + 4- 420 

AGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGG 
LHQDWLNGKEYKCKVSNKAL- 

TCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGG 

421 + + + + + + 480 

AGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCC 
PAPIEKTI SKARGQPREPQV- 

TGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCC 

481 + + + + + + 540 

ACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGG 
YTL PPSRDELTKNQVS LTC L - 

TGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGG 

541 + + + + + + 600 

ACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCGGCC 
VKGFY PSD IAVEWE SNGQ PE- 

AGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACA 

601 + + + + + + 660 

TCTTGTTG ATGTTCTGGTGC GG AGGGC AC G AC C TGAGGC TGC C G AGG AAG AAGG AG ATGT 
NNYKTTPPVLDSDGSFFLYS- 

GC AAGC TC AC C GTGG AC AAG AGC A GGTGGC AGC AGGGG AAC GTC TTCTC ATGC TC C GTG A 

661 + + + + + + 720 

CGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACT 
KLTVDKSRWQQGNVFSCSVM- 

TGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAAT 

721 + + + + + + 780 

ACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTA 
HEALHNHYTQKSLSLS PGR*- 

BamHI 
I 

AATGGATCC 

781 789 

TTACCTAGG 
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FIG. 13 



TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

1 + + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTGTGTACAG 

MDKTHTC P - 
CACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCGTCAGTCTTCCTCTTCCCCCCAAAAC 

61 + + + + + + 120 

GT GG AAC AGGTC G AGGC C TTG AGGAC C C C C C T GGC AGTC AG AAGG AG AAGGGGGG TTTTG 
PCPAPELLGGPSVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 + + + + + + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
KDT LM I S R T PEVTC VVVDV S - 

GC C AC G AAG AC C C TG AGGTC AAGTTC AAC TGGT AC GTGG AC GGC GTGG AGGTGC AT AATG 

181 + + + + + + 240 

CGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGCACCTCCACGTATTAC 
HED PEVKFNWYV DGVEVHNA- 

CCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCA 

241 + + + + + + 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
KTKPREEQYN STYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 + + + + + + 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
VLHQDWLNGKEYRC KVSNKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 + + + + + + 420 

GGG AGGGTC GGGGGT AGC TCTTTTGGT AGAGGTTTC GGTTTC C C GTQGGGGC TCTTGGTG 
LPA PIEKTI SKAKGQPRE PQ* 

AGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 - + + + + + + 480 

TCCACATGTGGGACGGGGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
VYTL PPSRDELTKNQVSLTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 - + + + + + + 540 

CGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCG 

LVKGFYPSDIAVEWESNGQP- 
CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 + + + + + + 600 

GC C TC TTGTTGATGTTC TGGTGC GG AGGGC AC G AC C TG AGGC TGC C G AGG AAG AAGG AG A 
ENNYKTTPPVLDSDGSFFLY- 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 + + + + + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
SKLTVDKSRWQQGNVFSCSV- 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 + + + + + + 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
MHEALHNHYTQKS LSLS PGK- 

AAGGTGG AGGTGGTGGTGG AGGT AC TT AC TCTTGC C AC TTC GGC C C GCTGAC TTGGGTTT 

721 + + + + + + 780 

TTCCACCTCCACCACCACCTCCATGAATGAGAACGGTGAAGCCGGGCGACTGAACCCAAA 
GGGGGGGTY SC H'FG PLTWVC- 

BamHI 
I 

GCAAACCGCAGGGTGGTTAATCTCGTGGATCC 

781 + + + -- 812 

CGTTTGGCGTCCCACCAATTAGAGCACCTAGG 
K P Q G G * 
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FIG. 14 



TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGGAGGTACTTACTCTTGCC 

1 + + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCCTCCATGAATGAGAACGG 
c MGGTYSCH- 

ACTTCGGCCCGCTGACTTGGGTATGTAAGCCACAAGGGGGTGGGGGAGGCGGGGGGGACA 

61 + + + + + + 120 

TGAAGCCGGGCGACTGAACCCATACATTCGGTGTTCCCCCACCCCCTCCGCCCCCCCTGT 
C FG PLTWVC KPQGGGGGGGDK- 

AAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCC 

121 -- + + + + + + 180 

TTTG AGTGTGT AC AGGTGG AAC GGGTC GTGG ACTTG AGG AC C C C C C TGGC AGTC AAAAGG 
C THTCPPCPAPELLGGPSVFL- 

TCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCG 

181 + + + + + + 240 

AGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGC 
C FPPKPKDTLMI SRTPEVTCV- 

TGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCG 

241 + + + + + + 300 

ACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATGCACCTGCCGC 
C VVDV S HED PBVK FNWYVDGV- 

TGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGCACGTACCGTG 

301 + + + + + + 360 

ACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCAC 
c EVHNAKTKPREEQYNSTYRV- 

TGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCA 

361 ♦ - - + + + + + 420 

ACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGT 
c VSVLTVLHQDWLNGKEY KCK- 

AGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGC 

421 + + + + + + 480 

TCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTTCGGTTTCCCG 
C VSNKALPA PI EKT I SKAKGQ- 

AGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTGACCAAGAACC 

481 + + + + + + 540 

TCGGGGCTCTTGGTGTCCACATGTGG^ACGGGGGTAGGGCCCTACTCGACTGGTTCTTGG 
C PRE PQVYTLPPS RDELTKNQ- 

AGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGG 

541 + + + + + + 600 

TCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGGCACCTCACCC 
C VSLTCLVKGFYPSDIAVEWE- 

AGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACG 

601 + + + + + + 660 

TCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGACCTGAGGCTGC 
C SNGQ PENNYKTT P PVLDSDG- 

GCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACG 

661 + + + ♦ + + ? 20 

CGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGC 
c 3 F F L Y 3 K L TV D K,S RWQ QG N V - 

TCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCT 

721 + + -.-- + + + + 780 

AGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGA 
c FSCSVMHEA- LHNHYTQKSLS- 

BamHI 
I 

CCCTGTCTCCGGGTAAATAATGGATCC 

781 ♦ + 807 

GGGACAGAGGCCCATTTATTACCTAGG 
c L S P G K * 
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Xbal Fl 

I 

TCTAGATTTGAGTTTTAACTTTTAGAAGGAGGAATAAAATATGGGAGGTACTTACTCTTG 

1 + + + + + + 60 

AGATCTAAACTCAAAATTGAAAATCTTCCTCCTTATTTTATACCCTCCATGAATGAGAAC 

M G G T Y S C - 

CCACTTCGGCCCACTGACTTGGGTTTGCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGG 

61 + + + + + + 120 

GGTGAAGCCGGGTGACTGAACCCAAACGTTTGGCGTCCCACCGCCGCCGCCGCCGCCACC 
HFGPLTWVCKPQGGGGGGGG 

TACCTATTCCTGTCATTTTGGCCCOCTGACCTGGGTATGTAAGCCACAAGGGGGTGGGGG 

121 + + + + + + 180 

ATGGATAAGGACAGTAAAACCGGGCGACTGGACCCATACATTCGGTGTTCCCCCACCCCC 
TYSCHFGPLTWVCKPQGGGG 

AGGCGGGGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGG 

181 + + + + ♦ ♦ 240 

TCCGCCCCCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCC 
GGGDKTHTCPPC PAPELLGG 

ACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCC 

241 + + + + + -+ 300 

TGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGG 
PSVFLFPPKPKDTLMISRTP- 

TG AGGTC AC ATGC GTGGTGGTGG AC GTG AGC C AC GAAGAC C C TG AGGTC AAGTTC AACTG 

301 + + + + + + 360 

ACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGAC 
EVTCVVVDVSHEDPEVKFNW- 

GTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAA 

361 + + + + + + 420 

CATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTT 
YVDGVEVHNAKTKPREEQYN- 

CAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAA 

421 + + + + + + 480 

GTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTT 
STYRVVSVLTVLHQDWLNGK 

GGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTC 

481 + + + + + + 540 

CCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAG 
EYKCKVSNKALPAPIEKTIS - 

CAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGA 

541 + + + + + + 600 

GTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACT 
KAKGQPREPQVYTLPPSRDE 

GCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACAT 

601 + + + + + + 660 

CGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTA 
LT KNQV S LTC LVKGFY P S D I - 

CGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGT 

661 + + + + + + 720 

GCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCA 
AVEWESNGQ PENNYKTTPPV - 

GCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTG 

721 + + + + + + 780 

CGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCAC 
LD S DGS FFLYS KL.TVDKS RW 

GCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACAC 

781 + + + + + + 840 

CGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTG 
QQGNVFSCSVMHE ALHNHYT- 

BamHI 
I 

GCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCC 

841 + + + - ♦ - 881 

C GTCTTCTCGGAGAGGGAC AGAGGCCC ATTT ATT ACCT AGG 
QKSLSLSPGK* 
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FIG. 16 

I 

TCTAGATTTGTTTTAACTAATTAAAGGAGGAATAACATATGGACAAAACTCACACATGTC 

1 + + + + + + 60 

AGATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCTGTTTTGAGTC 
C MDKTHTC P - 

CACCTTGCCCAGCACCTGAACTCCTGGGGGGACCGTCAGTTTTCCTCTC 

*1 + + + + + + 120 

GTGGAAC GGGTC GTGGACTTGAGGAC C CCC C TGGC AGTC AAAAGGAGAAGGGGGGTTTTG 
C PCPAPELLGGPSVFLFPPKP- 

CCAAGGACACCCTCATGATCTCCCGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGA 

121 + + + + + + 180 

GGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTCCAGTGTACGCACCACCACCTGCACT 
C KDTLMISRTPEVTCVVVDVS- 

GCCACGAAGACCCTGAGGTCAAGTTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATG 

181 + + + + + + 240 

C GGTGC TTC TGGG AC TC C AG TT C AAGTTGAC C ATGC AC C TGC C GC AC C TC C AC GT ATT AC 
C HEDPEVKFNWYVDGVEVHNA- 

C C AAGAC AAAGC C GC GGGAGG AGC AGT AC AAC AGC AC GT ACCGTGTGGTC AGC GTC C TC A 

241 + + + + + + 300 

GGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGT 
c RTKPRBEQYNSTYRVVSVLT- 

CCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAG 

301 + + - + + + 360 

GGCAGGACGTGGTCCTGACCGACTTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTC 
C VLHQDWLNGKEYKCKVSNKA- 

CCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCAC 

361 + + + + + + 420 

GGGAGGGTC GGGGGT AGCTC TTTTGGTAGAGGTTTC GGTTTCCC GTCGGGGCTC TTGGTG 
C LPAPIEKTISKAKGQPREP Q- 

AGGTGTACACCCTGCCTCCATCCCGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCT 

421 + + + + + + 480 

TCCACATGTGGGACGGAGGTAGGGCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGA 
c VYTL P PS RDELTKNQVS LTC- 

GCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGC 

481 + + + + + + 540 

C GGAC C AGTTTC C GAAGAT AGGGTC GC T GT AGC GGC AC C TC AC C CTCTC GTT AC C C GTC G 
C LVKGFY PSDIAVEWESNGQP- 

CGGAGAACAACTACAAGACCACGCCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCT 

541 + + + + + + 600 

GC C TCTTGTTG ATGTTCTGGTGC GG AGGGC AC G AC CTG AGGCTGC C GAGG AAGAAGG AG A 
c ENNYKTT PPVLD3DGSFFLY- 

ACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCG 

601 + + + + + + 660 

TGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGC 
C SKLTVDKSRWQQGNVFSCSV- 

TGATGCATGAGGCTCTGCACAACCACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTA 

661 + + + + + + 720 

ACTACGTACTCCGAGACGTGTTGGTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCAT 
c MHEALHNHYTQKS LSLS PGK- 

AAGGTGGAGGTGGTGGCGGAGGTACTTACTCTTGCCACTTCGGCCCACTGACTTGGGTTT 

721 + + + + + + 780 

TTCCACCTCCACCACCGCCTCCATGAATGAGAACGGTGAAGCCGGGTGACTGAACCCAAA 
c GGGGGGGTY SCHFGPLTWVC* 

GCAAACCGCAGGGTGGCGGCGGCGGCGGCGGTGGTACCTATTCCTGTCATTTTGGCCCGC 

781 + + + + + + 840 

C GTTTGGC GTC C C ACC GC C GC C GC C GC C GC C AC C ATGG AT AAGG AC AGT AAAAC C GGGC G 
C KPQGGGGGGGGTY SCHFG PL- 

BamHI 
I 

TG AC C TGGGT ATGT AAGCC AC AAGGGGGTT AATC TC G AGG ATC C 

841 + + + + 884 

ACTGGACCCATACATTCGGTGTTCCCCCAATTAGAGCTCCTAGG 
C TWVCKPQGG* 




FIG. 17A 

tA&tll sticky end] 5* GCGTAACGTATGCATGGTCTCC - 

(position #4 358 in pAMG21) 3* TGCACGCATTGCATACGTACC AGAGG - 

- CC ATGCGAGAGTAGGGAACTGCCAGGCATCAAATAAAACGAAAGGCTC AGTCGAAAGACT - 

- GGTACGCTCTCATCCCTTGACGGTCCGTAGTTTATTTTGCTTTCCGAGTCAGCTTTCTGA - 

- GGGCCTTTCGTTTTATCTGTTGTTTGTCGGTGAACGCTCTCCTGAGTAGGACAAATCCGC - 

- CCCGGAAAGCAAAATAGACAACAAACAGCCACTTGCGAGAGGACTCATCCTGTTTAGGCG - 

- C GGGAGC GGATTTGAAC GTTGC GAAGC AAC GGCCC GGAGGGTGGCGGGC AGGACGCCC GC - 

- GCCCTCGCCTAAACTTGCAACGCTTCGTTGCCGGGCCTCCC ACCGCCCGTCCTGCGGGCG - 

- C ATAAACTGCCAGGCATCAAATTAAGCAGAAGGCC ATCCTGACGGATGGCCTTTTTGCGT - 

- GTATTTGACGGTCCGTAGTTTAATTCGTCTTCCGGTAGGACTGCCTACCGGAAAAACGCA - 

Aftt ll 

- TTC T AC AAAC TC TTTTGTTT ATTTTTC T AAAT AC ATTC AAAT ATGGAC GTC GT AC TT AAC - 
" AAGATGTTTGAGAAAACAAATAAAAAGATTTATGTAAGTTTATACCTGCAGCATGAATTG - 

- TTTTAAAGTATGGGCAATCAATTGCTCCTGTTAAAATTGCTTTAGAAATACTTTGGCAGC - 

- AAAATTTCATACCCGTTAGTTAACGAGGACAATTTTAACGAAATCTTTATGAAACCGTCG - 

- GGTTTGTTGTATTGAGTTTCATTTGCGCATTGGTTAAATGGAAAGTGACCGTGCGCTTAC - 

- CCAAACAACATAACTCAAAGTAAACGCGTAACCAATTTACCTTTCACTGGCACGCGAATG - 

- TACAGCCTAATATTTTTGAAATATCCCAAGAGCTTTTTCCTTCGCATGCCCACGCTAAAC - 

- ATGTCGGATTATAAAAACTTTATAGGGTTCTCGAAAAAGGAAGCGTACGGGTGCGATTTG - 

- ATTCTTTTTCTCTTTTGGTTAAATCGTTGTTTGATTTATTATTTGCTATATTTATTTTTC - 

- T AAGAAAAAGAGAAAACCAATTTAGCAACAAACTAAATAAT AAAC GAT ATAAATAAAAAG - 

- GATAATTATCAACTAGAGAAGGAACAATTAATGGTATGTTCATACACGCATGTAAAAATA - 

- CTATTAATAGTTGATCTCTTCCTTGTTAATTACCATACAAGTATGTGCGTACATTTTTAT - 

- AACTATCTATATAGTTGTCTTTCTCTGAATGTGCAAAACTAAGCATTCCGAAGCCATTAT - 

- TTGATAGATATATCAACAGAAAGAGACTTACACGTTTTGATTCGTAAGGCTTCGGTAATA - 

- T AGC AGT ATG AAT AGGG AAAC TAAAC C C AGTGATAAG ACC TG ATG ATTTC GC TTC TTT AA - 

- ATCGTCATACTTATCCCTTTGATTTGGGTCACTATTCTGGACTACTAAAGCGAAGAAATT - 

- TTACATTTGGAGATTTTTTATTTACAGCATTGTTTTCAAATATATTCCAATTAATCGGTG - 

- AATGTAAACCTCTAAAAAATAAATGTCGTAACAAAAGTTTATATAAGGTTAATTAGCCAC - 

- AATGATTGGAGTTAGAATAATCTACTATAGGATCATATTTTATTAAATTAGCGTC ATCAT - 

- TTACTAACCTCAATCTTATTAGATGATATCCTAGTATAAAATAATTTAATCGCAGTAGTA - 

- AATATTGCCTCCATTTTTTAGGGTAATTATCCAGAATTGAAATATCAGATTTAACCATAG - 
_ TTATAACGGAGGTAAAAAATCCCATTAATAGGTCTTAACTTTATAGTCTAAATTGGTATC - 

- AATGAGGATAAATGATCGCGAGTAAATAATATTCACAATGTACCATTTTAGTCATATCAG - 

- TTACTCCTATTTACTAGCGCTCATTTATTATAAGTGTTACATGGTAAAATCAGTATAGTC - 

- ATAAGCATTGATTAATATCATTATTGCTTCTACAGGCTTTAATTTTATTAATTATTCTGT - 

- TATTCGTAACTAATTATAGTAATAACGAAGATGTCCGAAATTAAAATAATTAATAAGACA - 

- AAGTGTCGTCGGCATTTATGTCTTTCATACCCATCTCTTTATCCTTACCTATTGTTTGTC - 
• TTC AC AGC AGC C GT AAAT AC AGAAAGT ATGGGT AG AGAAAT AGG AATGGAT AAC AAAC AG - 

- GC AAGTTTTGCGTGTTATATATCATTAAAACGGTAATAGATTGACATTTGATTCTAATAA - 

- CGTTCAAAACGCACAATATATAGTAATTTTGCCATTATCTAACTGTAAACTAAGATTATT - 



-ATTGGATTTTTGTCACACTATTATATCGCTTGAAATACAATTGTTTAACATAAGTACCTG 
- T AAC C T AAAAAC AGTGTG AT AAT AT AGC GAAC TTT ATGTTAAC AAATTGT ATTC ATGG AC 

-TAGGATCGTACAGGTTTACGCAAGAAAATGGTTTGTTATAGTCGATTAATCGATTTGATT 
- ATCCTAGCATGTCCAAATGCGTTCTTTTACCAAACAATATCAGCTAATTAGCTAAACTAA 

- C T AGATTTGTTTT AACTAATTAAAGGAGGAAT AAC AT ATGGTT AAC GC GTTGGAATTC GA 
-GATCTAAACAAAATTGATTAATTTCCTCCTTATTGTATACCAATTGCGCAACCTTAAGCT 

Sac I i 

-GCTCACTAGTGTCGACCTGCAGGGTACCATGGAAGCTTACTCGAGGATCCGCGGAAAGAA 
-CGAGTGATCACAGCTGGACGTCCCATGGTACCTTCGAATGAGCTCCTAGGCGCCTTTCTT 

-GAAGAAGAAGAAGAAAGCCCGAAAGGAAGCTGAGTTGGCTGCTGCCACCGCTGAGCAATA 
-CTTCTTCTTCTTCTTTCGGGCTTTCCTTCGACTCAACCGACGACGGTGGCGACTCGTTAT 

-ACTAGCATAACCCCTTGGGGCCTCTAAACGGGTCTTGAGGGGTTTTTTGCTGAAAGGAGG 
-TGATCGTATTGGGGAACCCCGGAGATTTGCCCAGAACTCCCCAAAAAACGACTTTCCTCC 

AACCGCTCTTCACGCTCTTCACGC 3' (S&£ll sticky end] 

TTGGCGAGAAGTGCGAGAAGTG 5' (position #5904 in pAMG21) 
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CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG 

1 + + + + + + 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 

MDKTHTCPPCPAPELLGGP 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 + + + + + + 120 

AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121 + + + + + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 + + + + + + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 + + + + + + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

301 + + + + + + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

361 + + + + + + 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 + + + + + + 480 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLTCLVKGFYPSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 + + + + - - - + + 540 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWE SNG QPENN YKTTPPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

541 + + + + + + 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGSFFLYSKLTVDKSRWQ 
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FIG. 191 



CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

601 + + + + + + 660 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTGGTGGTGACTTCCTGCCGCACTAC 

661 + + + + + + 720 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCTCCACCACCACTGAAGGACGGCGTGATG 

KSLSLSPGKGGGGGDFLPHY 

BamHI 
I 

AAAAACACCTCTCTGGGTCACCGTCCGTAATGGATCC 

721 + + + 757 

TTTTTGTGGAGAGACCCAGTGGCAGGCATTACCTAGG 

KNTSLGHRP* 
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FIG. 20A 



61 



Ndel 
I 

CATATGGACTTCCTGCCGCACTACAAAAACACCTCTCTGGGTCACCGTCCGGGTGGAGGC 

+ + + + + + 

GTATACCTGAAGGACGGCGTGATGTTTTTGTGGAGAGACCCAGTGGCAGGCCCACCTCCG 

MDFLPHYKNTSLGHRPGGG 

GGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCG 

+ + + + + + 

CCACCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGC 

GGDKTHTCPPCPAPELLGGP 

TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 



60 



120 



121 



■+ 180 



AGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 



181 



•+ 240 



CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 



241 



■+ 300 



CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 
VDGVEVHN AKTKPREEQYNS 
ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 



301 



■+ 360 



TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 



361 



•+ 420 



421 



ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

+ + + + + + 480 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKG QPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 



481 



TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 
TKNQVSLTCLVK GFYPSDIA 
GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 



•+ 540 



541 



CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 
VEWE SNGQPENNYKTTPPVL 



■+ 600 



SUBCLASS 



DRAFTS'" ' 



O 



o 



601 



661 



GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

+ + + + -f + 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGS FFLYSKLTVDKSRWQ 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 
+ + + + + + 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 
QGNVF SCSVMHEALHNHYTQ 



660 



720 



BamHI 

I 

AAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCCGCGG 

721 + + + + - 761 

TTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGGCGCC 



i i u . 



f-G iSUBCLASS 



DRAF 



o 



o 



Ndel 
I 

CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG 

1 + + + + + + 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 

MDKTHTCPPCPAPELLGGP 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 + + + + + + 120 

AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121 + + + - + + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 + + + + + + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 + + + + + + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

301 + + + - + + + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

361 + + + + + + 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQV. YTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 + - - + + + + + 480 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLTCLVKGFYPS DIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 + + + + + + 540 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPEN. NYKT TPPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

541 + + + + + + 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

D S D G S F F L*Y S K LTVDK S RWQ 



o o 



LASS 



F! 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

01 + + + + + + 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 
QGNVF SCSVMHEALHNHYTQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTGGTGGTTTCGAATGGACCCCGGGT 

61 + + + + + + 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCTCCACCACCAAAGCTTACCTGGGGCCCA 

KSLSLSPGKGGGGGFEWTPG 

BamHI 
I 

TACTGGCAGCCGTACGCTCTGCCGCTGTAATGGATCCCTCGAG 

121 + + + + "* 763 

ATGACCGTCGGCATGCGAGACGGCGACATTACCTAGGGAGCTC 

YWQPYALPL* 



TIsuSCLASS 



DRAF15* 



O 



O 



F 



Ndel 



CATATGTTCGAATGGACCCCGGGTTACTGGCAGCCGTACGCTCTGCCGCTGGGTGGAGGC 

1 + + - + + + + 60 

GTATACAAGCTTACCTGGGGCCCAATGACCGTCGGCATGCGAGACGGCGACCCACCTCCG 

MFEWTPGYWQPYALPLGGG 

GGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGGGGACCG 

61 + + + + + + 120 

CCACCCCTGTTTTGAGTGTGTACAGGTGGAACGGGTCGTGGACTTGAGGACCCCCCTGGC 

GGDKTHTCPPCPAPELLGGP 

TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

121 + + + + + + 180 

AGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

181 + + + + + + 240 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPE V KFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

241 + + + + + + 300 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

301 + + + + + + 360 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDW LNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

361 + + + + + + 420 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

421 + + + + + + 480 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

481 + + + ---- + -- + + 540 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

T KNQVS LT'C LVK G F Y PSD I A 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

541 + + + + + + 600 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL 



o 



o 



ASS 



F! 




GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 
01 + + + + + + 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 
DSDGSFFLYSKLTVDKSRWQ 

CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 
61 + + + + + + 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 
QGNVFSCSVMHEALHNHYTQ 

BamHI 



1 + + + 757 

TTCTCGGAGAGGGACAGAGGCCCATTTATTACCTAGG 

KSLSLSPGK* 



I ■ . -:,S 1 SUBCLASS 

DRAFTS^ ~ ; I 



o o 



FIG. 23A 



Ndel 
I 

CATATGGACAAAACTCACACATGTCCACCGTGCCCAGCACCTGAACTCCTGGGGGGACCG 

1 + + + + + + 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGCACGGGTCGTGGACTTGAGGACCCCCCTGGC 

MDKTHTCPPCPAPELLGGP 

TCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 + + + + + + 120 

AGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMISRTPE, 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121 + + + + + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 + + + + + + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATGTTGTCG 

VDGVEVHNAKTKP. REEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 + + + + + + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQDWLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

301 + + + + + + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPI EKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

361 + + + + + + 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 + + + + + + 480 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLTCLVKGFYPSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 + + + + + + 540 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

541 + + + + + + 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGS FFLYSKLTVDKSRWQ 



* SUBCLASS 



D3AFTVV.- > 



o 



o 



F 



CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 
601 + + + + + + 660 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGTGGTGGTGGTGTTGAACCGAACTGTGAC 

661 + + + + + + 720 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCACCACCACCACAACTTGGCTTGACACTG 

KSLSLSPGKGGGGGVEPNCD 

BamHI 
I 

ATCCATGTTATGTGGGAATGGGAATGTTTTGAACGTCTGTAACTCGAGGATCC 

721 + + + + + 773 

TAGGTACAATACACCCTTACCCTTACAAAACTTGGAGACATTGAGCTCCTAGG 

IHVMWEWECFERL* 



DRAFT:! 



: i U . 

I SUBCLASS 
I 

i 



o o 



F 



Ndel 
I 

CATATGGTTGAACCGAACTGTGACATCCATGTTATGTGGGAATGGGAATGTTTTGAACGT 

1 + + + + + + 60 

GTATACCAACTTGGCTTGACACTGTAGGTACAATACACCCTTACCCTTACAAAACTTGCA 

MVEPNCDIHVMWEWECFER 

CTGGGTGGTGGTGGTGGTGACAAAACTCACACATGTCCACCGTGCCCAGCACCTGAACTC 

61 + + + + + + 120 

GACCCACCACCACCACCACTGTTTTGAGTGTGTACAGGTGGCACGGGTCGTGGACTTGAG 

LG GGGGD KTHTC P PC PA P E L 

CTGGGGGGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCC 

121 + + + + + + 180 

GA^CCCCCTGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGG 

LGGPSVFLFPPKPKDTLMIS 

CGGACCCCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAG 

181 + + + + + + 240 

GCCTGGGGACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTC 

RTPEVTCVVVDVSHEDPEVK 

TTCAACTGGTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAG 

241 + + + + + + 300 

AAGTTGACCATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTC 

FNWYVDGVEVHNAKTKPREE 

CAGTACAACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTG 

301 + + + + + + 360 

GTCATGTTGTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGAC 

QYNSTYRVVSVLTVLHQDWL 

AATGGCAAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAA 

361 + + + + + + 420 

TTACCGTTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTT 

NGKEYKCKVSNKALPAPIEK 

ACCATCTCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCC 

421 + + + + + + 480 

TGGTAGAGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGG 

T ISKAKGQPREPQVYTLPPS 

CGGGATGAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCC 

481 + + + + + + 540 

GCCCTACTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGG 

RDELTKNQVSLTCLVKGFYP 

AGCGACATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACG 

541 + + + + + + 600 

TCGCTGTAGCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGC 

SDIAVEWESNGQPENNYKTT 



o 



o 



1 SUBCLASS 



DRAFT ■ 



FIG. 241 



CCTCCCGTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAG 

601 + + + + + + 660 

GGAGGGCACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTC 

PPVLDSDGSFFLYSK LTVDK 

AGCAGGTGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAAC 

661 + + + + + + 720 

TCGTCCACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTG 

SRWQQGNVFSCSVMHEALHN 

BamHI 
I 

CACTACACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAACTCGAGGATCC 

721 + + + + + 773 

GTGATGTGCGTCTTCTCGGAGAGGGACAGAGGCCCATTTATTGAGCTCCTAGG 



HYTQKSLSLSPGK 




FIG. 25A 



Ndel 



CATATGGACAAAACTCACACATGTCCACCTTGTCCAGCTCCGGAACTCCTGGGGGGACCG 

1 + + + + + + 60 

GTATACCTGTTTTGAGTGTGTACAGGTGGAACAGGTCGAGGCCTTGAGGACCCCCCTGGC 

MDKTHTCPPCPAPELLGGP 

TCAGTCTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACCCCTGAG 

61 + + + + + + 120 

AGTCAGAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGGGGACTC 

SVFLFPPKPKDTLMI SRTPE 

GTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAACTGGTAC 

121 + + + + + + 180 

CAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTGACCATG 

VTCVVVDVSHEDPEVKFNWY 

GTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTACAACAGC 

181 + + + + + + 240 

CACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATfiTTGTCG 

VDGVEVHNAKTKPREEQYNS 

ACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGCAAGGAG 

241 + + + + + + 300 

TGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCGTTCCTC 

TYRVVSVLTVLHQD WLNGKE 

TACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATCTCCAAA 

301 + + + + + + 360 

ATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAGAGGTTT 

YKCKVSNKALPAPIEKTISK 

GCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGATGAGCTG 

361 + + + + + + 420 

CGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTACTCGAC 

AKGQPREPQVYTLPPSRDEL 

ACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGACATCGCC 

421 + + + + + + 480 

TGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTGTAGCGG 

TKNQVSLTCLVKGFYPSDIA 

GTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCCGTGCTG 

481 + + + - + + + 540 

CACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGGCACGAC 

VEWESNGQPENNYKTTPPVL 

GACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGGTGGCAG 

541 + - - - + + + + + 600 

CTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCCACCGTC 

DSDGS FFL YS KLTVDKSRWQ 



5 jSUBCLASS 

I 



o 



o 



F 



CAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTACACGCAG 

601 + + - + + + + 660 

GTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATGTGCGTC 

QGNVFSCSVMHEALHNHYTQ 

AAGAGCCTCTCCCTGTCTCCGGGTAAAGGTGGAGGTGGTGGTTGCACCACCCACTGGGGT 

661 + + + + + + 720 

TTCTCGGAGAGGGACAGAGGCCCATTTCCACCTCCACCACCAACGTGGTGGGTGACCCCA 

KSLSLSPGK-GGGGGCTTHWG 

BamHI 
I 

TTCACCCTGTGCTAATGGATCCCTCGAG 

721 + + 748 

AAGTGGGACACGATTACCTAGGGAGCTC 

F T L C * 



i;S iSUSCLASS 



DRAFTS-- 



O 



G 



F 



Ndel 
I 

CATATGTGCACCACCCACTGGGGTTTCACCCTGTGCGGTGGAGGCGGTGGGGACAAAGGT 



+ 60 



GTATACACGTGGTGGGTGACCCCAAAGTGGGACACGCCACCTCCGCCACCCCTGTTTCCA 

MCTTHWGFTLCGGGGGDKG 
GGAGGCGGTGGGGACAAAACTCACACATGTCCACCTTGCCCAGCACCTGAACTCCTGGGG 



61 



■+ 120 



121 



C C TC C GC C AC C C C TGTTTTG AGTGT GT AC AGGTGG AACGGGTC GTGG AC TTG AGGACC C C 
GGGGDKTHTCPPC PAPELLG 
GGACCGTCAGTTTTCCTCTTCCCCCCAAAACCCAAGGACACCCTCATGATCTCCCGGACC 

+ + 4- + + + 

CCTGGCAGTCAAAAGGAGAAGGGGGGTTTTGGGTTCCTGTGGGAGTACTAGAGGGCCTGG 
GPSVFLFPPKPKDTLMI SRT 
CCTGAGGTCACATGCGTGGTGGTGGACGTGAGCCACGAAGACCCTGAGGTCAAGTTCAAC 



180 



181 



GGACTCCAGTGTACGCACCACCACCTGCACTCGGTGCTTCTGGGACTCCAGTTCAAGTTG 

PEVTCVVVDVSHEDPEVKFN 

TGGTACGTGGACGGCGTGGAGGTGCATAATGCCAAGACAAAGCCGCGGGAGGAGCAGTAC 



■ + 240 



241 



ACCATGCACCTGCCGCACCTCCACGTATTACGGTTCTGTTTCGGCGCCCTCCTCGTCATG 

WYVDGVEVHNAKTKPREEQY 

AACAGCACGTACCGTGTGGTCAGCGTCCTCACCGTCCTGCACCAGGACTGGCTGAATGGC 



-+ 300 



301 



TTGTCGTGCATGGCACACCAGTCGCAGGAGTGGCAGGACGTGGTCCTGACCGACTTACCG 

NSTYRVVSVLTVLHQDWLNG 

AAGGAGTACAAGTGCAAGGTCTCCAACAAAGCCCTCCCAGCCCCCATCGAGAAAACCATC 



•+ 360 



361 



TTCCTCATGTTCACGTTCCAGAGGTTGTTTCGGGAGGGTCGGGGGTAGCTCTTTTGGTAG 
KEYKCKVSNK ALPAPIEKTI 
TCCAAAGCCAAAGGGCAGCCCCGAGAACCACAGGTGTACACCCTGCCCCCATCCCGGGAT 



+ 420 



421 



AGGTTTCGGTTTCCCGTCGGGGCTCTTGGTGTCCACATGTGGGACGGGGGTAGGGCCCTA 

SKAKGQPREPQVYTLPPiSRD 

GAGCTGACCAAGAACCAGGTCAGCCTGACCTGCCTGGTCAAAGGCTTCTATCCCAGCGAC 



■+ 480 



481 



CTCGACTGGTTCTTGGTCCAGTCGGACTGGACGGACCAGTTTCCGAAGATAGGGTCGCTG 
E LTKNQVSLTCLVKGFYPSD 
ATCGCCGTGGAGTGGGAGAGCAATGGGCAGCCGGAGAACAACTACAAGACCACGCCTCCC 



■+ 540 



541 



TAGCGGCACCTCACCCTCTCGTTACCCGTCGGCCTCTTGTTGATGTTCTGGTGCGGAGGG 
IAVEWESNGQPENNYKTTPP 



•+ 600 




GTGCTGGACTCCGACGGCTCCTTCTTCCTCTACAGCAAGCTCACCGTGGACAAGAGCAGG 
601 + + + + + + 660 

CACGACCTGAGGCTGCCGAGGAAGAAGGAGATGTCGTTCGAGTGGCACCTGTTCTCGTCC 
V LDSDGS FFLYSKLTVDKSR 

TGGCAGCAGGGGAACGTCTTCTCATGCTCCGTGATGCATGAGGCTCTGCACAACCACTAC 
661 + + + + + + 720 

ACCGTCGTCCCCTTGCAGAAGAGTACGAGGCACTACGTACTCCGAGACGTGTTGGTGATG 

WQQGNVFSCSVMHEALHNHY 

BamHI 
I 

ACGCAGAAGAGCCTCTCCCTGTCTCCGGGTAAATAATGGATCC 

721 + + + + 763 

TGCGTCTTCTC GG AG AGGG AC AG AGGC C C ATTT ATT AC C T AGG 



TQKSLSLSPGK 



1» 



f 



FIG. 2A 



FIG. 2B FIG. 2C 
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